Re: 22303-1 
{1st Draft) 

BLADE SERVER PERFORMANCE MANAGEMENT METHOD AND SYSTEM 

BACKGROUND OF THE INVENTION 

1. Field of the Invention: 

This invention relates to computer network server technology, and more 
particularly, to a blade server performance management method and system, which is 
designed for use with a blade server including a cluster of server modules to provide a 
performance management function on the clustered server modules with the purpose of 
allowing all the clustered server modules to have sufficient power load to operate normally 
as well as protecting each of the clustered server modules from being burned out due to an 
overly high working temperature during high-performance operation. 

2. Description of Related Art: 

Blade server is a clustering type of network server, which is characterized by the 
use of a circuit board enclosure to integrate a cluster of server modules (commonly called 
"blades"), with all of these server modules providing the same server function. In other 
words, a blade server can respond to a client's request by linking any one of the clustered 
server modules to the client. In practice, each server module is embodied as a single circuit 
board (i.e., blade), which can be easily fitted to the blade server's enclosure to increase the 
blade server's client serving capacity. 

Fundamentally, each of the clustered server modules in a blade server is equipped 
with one or more CPUs (Central Processing Unit) for independent data processing, and 
each CPU typically comes with two or more operating modes of different performance 
levels that can be arbitrarily set by system or user to provide different data processing 
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Speeds. Different models of CPUs may offer different operating modes. For example, a 
certain type of CPU offers two operating modes, respectively called automatic mode and 
on-demand mode, wherein the automatic mode is higher in performance level than the 
on-demand mode; and another type of CPU offers two operating modes, respectively called 
5 automatic mode and throttling mode, wherein the automatic mode is higher in performance 
level than the throttling mode. Fundamentally, a high level of operating mode would 
provide a higher processing speed, but would notwithstanding consume more electrical 
power and cause higher working temperature in the CPU. 

In application, the clustered server modules in blade server are typically set to 

10 different CPU operating modes, so that some of them can provide higher data serving 
capacity and speed. However, since a server module with a higher CPU performance level 
also has a higher rate of power consumption, it may cause other server modules with a 
lower CPU performance to be underpowered and thus unable to operate normally. 

Moreover, since a server module with a higher CPU performance level consumes 

15 more electrical power, it would lead to a higher working temperature during operation. 
When a server module's CPU working temperature exceeds an overheating limit, it could 
cause the server module's CPU to burn out and thus fail to operate normally. 

SUMMARY OF THE INVENTION 

It is therefore an objective of this invention to provide a blade server performance 
20 management method and system that allows all of the clustered server modules in a blade 
server to have sufficient power load to operate normally. 
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It is another objective of this invention to provide a blade server performance 
management method and system that can protect high-performance server modules in a 
blade server from being burned out due to an overly high working temperature. 

The blade server performance management method and system according to the 
5 invention is designed for use with a blade server including a cluster of server modules, each 
server module having at least two different operating modes, for providing a performance 
management function on the clustered server modules in the blade server. 

The blade server performance management method and system according to the 
invention is based on the method steps of: performing a power-load detecting procedure to 
1 0 detect the current distribution of power load by the power supply of the blade server to the 
clustered server modules in the blade server; if the current distribution of power load to the 
clustered server modules is below a rated power level, performing an operating mode 
inspecting procedure to inspect the current operating modes of the server modules to find 
the highest-performance server module; and performing a power-initiated operating mode 
15 adjusting procedure to switch the highest-performance server module to a lower level of 
operating mode. 

The blade server performance management method and system according to the 
invention allow each of the clustered server modules in the blade server to have sufficient 
power load to operate normally as well as to protect each of the clustered server modules 
20 from being burned out due to an overly high working temperature during high-performance 
operation. The blade server performance management method and system according to the 
invention is therefore advantageous to use. 
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BRIEF DESCRIPTION OF DRAWINGS 

The invention can be more fully understood by reading the following detailed 
description of the preferred embodiments, with reference made to the accompanying 
drawings, wherein: 

5 FIG. 1 is a schematic diagram showing an object-oriented component model of the 

blade server performance management system according to the invention; and 

FIG. 2 is a flow diagram showing the method steps performed by the blade server 
performance management system according to the invention. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The blade server performance management method and system according to the 
invention is disclosed in full details by way of preferred embodiments in the following with 
reference to the accompanying drawings. 

FIG. 1 is a schematic diagram showing the object-oriented component model of the 
blade server performance management system (as the part enclosed in the dotted box 
indicated by the reference numeral 100). As shown, in application, the blade server 
performance management system of the invention 100 is integrated to a blade server 10 
which includes a cluster of server modules (commonly called "blades"), for example 5 
server modules 11, 12, 13, 14, 15 (note that 5 server modules are shown here only for 
demonstrative purpose, and in practice, the blade server 10 may include any number of 
server modules). The server modules 11, 12, 13, 14, 15 each include one or more CPUs (not 
shown) for data processing, and each CPU comes with two or more operating modes of 
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different performance levels that can be arbitrarily set by system or user to provide 
different data processing speeds. 

The CPU operating modes available in the server modules 11, 12, 13, 14, 15 may be 
different in different models. For example, a certain type of CPU may offer two operating 
modes, respectively called automatic mode and on-demand mode, v/herein the automatic 
mode is higher in performance level than the on-demand mode; and another type of CPU 
may offer two operating modes, respectively called automatic mode and throttling mode, 
wherein the automatic mode is higher in performance level than the throttling mode. 
Fundamentally, a higher level of operating mode would provide a higher processing speed, 
but would notwithstanding consume more electrical power and cause higher working 
temperature in the CPU. 

In application, the clustered server modules 11, 12, 13, 14, 15 in the blade server 10 
are typically set to different CPU operating modes, so that some of them can provide higher 
data serving capacity and speed. For example, the first two server modules 11, 12 can be set 
to automatic modes, while the next two server modules 13, 14 are set to on-demand modes 
and the last server module 15 is set to throttling mode. 

The object-oriented component model of the blade server performance 
management system of the invention 100 comprises: (a) a power-load detecting module 
110; (b) a operating mode inspecting module 120; and (c) an operating mode adjusting 
module 130; and further comprises a temperature inspecting module 140. 

The power-load detecting module 110 is capable of detecting whether the current 
distribution of power load by the power supply 20 of the blade server 10 to the clustered 
server modules 11, 12, 13, 14, 15 in the blade server 20 is below a rated power level; and if 
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YES, the power-load detecting module 110 will promptly issue a power-initiated mode 
down request MODEJDOWN_PW to the operating mode adjusting module 130. 

The operating mode inspecting module 120 is capable of inspecting what operating 
mode is currently being set to each of the clustered server modules 11, 12, 13, 14, 15 in the 
blade server 10 to find which one or ones are currently being set in the highest-level of 
operating mode. The inspected information is then transferred to the operating mode 
adjusting module 130. 

The operating mode adjusting module 130 is capable of being activated in response 
to the mode down r^qncsi MODE _DOWN_PW from the power-load detecting module 110 
and based on the inspected information from the operating mode inspecting module 120 to 
adjust the operating mode of the highest-performance one of the server modules 11, 12, 13, 
14, 15 in the blade server 10 to a lower level of operating mode (if two or more server 
modules are operating at the same highest-level of operating mode, then one of them is 
selected). For example, if the highest-performance server module is currently set in 
automatic mode, then its operating mode is switched to on-demand mode or throttling 
mode which is a low-performance operating mode. After this, the power-load detecting 
module 110 will continue to detect whether the subsequent distribution of power load by 
the power supply 20 of the blade server 10 to the clustered server modules 11, 12, 13, 14, 
15 in the blade server 20 is above the rated power level; and if still below, the power-load 
detecting module 110 will issue another mode down request MODEJDOWNJPW to the 
operating mode adjusting module 130 to activate the operating mode adjusting module 130 
to switch another highest-performance server module tp a lower level of operating mode. 
This procedure will be repeated until the power-load detecting module 110 detects that the 
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distribution of power load by the power supply 20 of the blade server 10 to the clustered 
server modules 11, 12, 13, 14, 15 in the blade server 20 is above the rated power level. 

The temperature inspecting module 140 is capable of inspecting whether the 
current working temperature of any one of the clustered server modules 11, 12, 13, 14, 15 
in the blade server 10 is below an overheating limit; and if one or more server module are 
above the overheating limit, the temperature inspecting module 140 will promptly initiate a 
temperature lowering procedure by issuing a temperature-initiated mode down request 
MODEJ)OWNJTEMP to the operating mode adjusting module 130 to activate the 
operating mode adjusting module 130 to switch the currently overheated server module(s) 
to a lower level of operating mode. After this, the temperature inspecting module 140 will 
continue to inspect the subsequent working temperature of the previously overheated 
server module to see if it is now below the overheating limit; and if still not, the 
temperature inspecting module 140 will activate the operating mode adjusting module 130 
to further adjust the overheated server module to a still lower level of operating mode (if 
available). This procedure will be repeated until the working temperature of the overheated 
server module is below the overheating limit. Alternatively, in another preferred 
embodiment, the temperature inspecting module 140 can directly issue a TCC enable 
signal TCC_ENABLE to the overheated server module to activate the CPU on the 
overheated server module to perform a built-in TCC (Thermal Control Circuit) procedure 
to lower its working temperature by reducing performance, such as by reducing the duty 
cycles of the CPU clock signal to 30%-50%. 

FIG. 2 is a flow diagram showing the method steps performed by the blade server 
performance management system of the invention 100 during operation. 
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Referring to FIG. 2 together with FIG. 1, the initial step SIO is to perform a 
power-load detecting procedure, wherein the power-load detecting module 110 is activated 
to detect whether the current distribution of power load by the power supply 20 of the blade 
server 10 to the clustered server modules 11, 12, 13, 14, 15 in the blade server 20 is below 
5 a rated power level; If YES, the power-load detecting module 110 promptly issues a mode 
down request MODE_DOWN_PW to the operating mode adjusting module 130, and the 
procedure goes to the next step Sll. 

The next step Sll is to perform an operating mode inspecting procedure, wherein 
the operating mode inspecting module 120 is activated to inspect what operating mode is 

10 currently being set to each of the server modules 11, 12, 13, 14, 15 in the blade server 10 to 
find which one or ones are currently being set in the highest-level of operating mode, and 
then transfer the inspected information to the operating mode adjusting module 130. The 
procedure then goes to the step 812. 

In the step S12, an operating mode adjusting procedure is performed, wherein the 

1 5 operating mode adjusting module 130 is activated in response to the mode down request 
MODEJDOWN_PW from the power-load detecting module 110 and based on the 
inspected information from the operating mode inspecting module 120 to switch the 
highest performance server module to a lower level of operating mode (if two or more 
server modules are operating in the same highest-level of operating mode, then one of them 

20 is selected). For example, if the highest-performance server module is currently operating 
in automatic mode, then its operating mode is switched to on-demand mode or throttling 
mode which is a lower level of operating mode. 
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After the task of step S12 is completed, the procedure goes back to the previous step 
SIO to repeat the power-load detecting procedure, wherein the power-load detecting 
module 110 continues to detect whether the subsequent distribution of power load by the 
power supply 20 of the blade server 10 to the clustered server modules 11, 12, 13, 14, 15 in 
the blade server 20 is above the rated power level; and if still below, the steps S11-S12 are 
repeated again. The looped steps S10-S12 are repeated until the power-load detecting 
module 110 detects that the distribution of power load by the power supply 20 of the blade 
server 10 to the clustered server modules 11, 12, 13, 14, 15 in the blade server 20 is above 
the rated power level. 

As the power load to all of the clustered server modules 11, 12, 13, 14, 15 in the 
blade server 10 is sufficient, the procedure then goes to the step S20 to perform a 
temperature inspecting procedure, wherein the temperature inspecting module 140 is 
activated to inspect whether the current working temperature of any one of the server 
modules 11, 12, 13, 14, 15 exceeds an overheating limit; if NO, the procedure is ended; 
whereas if YES, the procedure goes to the step S21. 

In the step S21, a temperature lowering procedure is initiated with the purpose of 
lowering the working temperature of the overheated server module(s). This temperature 
lowering procedure has two different embodiments. 

By the first embodiment, the temperature inspecting module 140 is activated to 
issue a mode down request MODE_DOWNjrEMP to the operating mode adjusting 
module 130 to activate the operating mode adjusting module 130 to switch the overheated 
server module(s) to a lower level of operating mode. After this, the temperature inspecting 
module 140 will continue to inspect the subsequent working temperature of the previously 
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overheated server module to see if it is now below the overheating limit; and if still not, the 
temperature inspecting module 140 will activate the operating mode adjusting module 130 
to switch the overheated server module to a still lower level of operating mode (if 
available). This procedure will be repeated until the working temperature of the overheated 
5 server module is below the overheating limit. 

By the second embodiment, the temperature inspecting module 140 directly issues 
a TCC enable signal TCC^ENABLE to the overheated server module to activate the 
overheated server module to perform a built-in TCC (Thermal Control Circuit) procedure 
to lower its working temperature by reducing performance, such as by reducing the duty 

10 cycles of the CPU clock signal to 30%-50%. 

After the task of step S21 is completed, the procedure goes back to the previous step 
S20 to repeat the temperature inspecting procedure, wherein the temperature inspecting 
module 140 continues to inspect whether the subsequent working temperature of the 
previously overheated server module(s) still exceeds the overheating limit; if NO, the 

15 procedure is ended; whereas if YES, the procedure goes to the step S21 to repeat the 
temperature lowering procedure again. The looped steps S20-S21 are repeated until the 
temperature inspecting module 140 inspects that the subsequent working temperature of 
the previously overheated server module(s) is below the overheating limit. 

In conclusion, the invention provides a blade server performance management 

20 method and system, which is designed for use with a blade server including a cluster of 
server modules to provide a performance management function on the clustered server 
modules in the blade server so as to allow all the clustered server modules in the blade 
server to have sufficient power load to operate normally as well as to protect each of the 
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clustered server modules from being burned out due to an overly high working temperature 
during high-performance operation. The blade server performance management method 
and system according to the invention is therefore advantageous to use. 

The invention has been described using exemplary preferred embodiments. 
However, it is to be understood that the scope of the invention is not limited to the 
disclosed embodiments. On the contrary, it is intended to cover various modifications and 
similar arrangements. The scope of the claims, therefore, should be accorded the broadest 
interpretation so as to encompass all such modifications and similar arrangements. 



